add mamba causal-conv1d-update kernel #48

thoangtrvn · 2025-03-21T14:59:31Z

This is related to the prior PR: #47

This adds the second Triton kernel (decode stage) to be used in mamba-based model for the inference purpose.

lessw2020 · 2025-04-10T02:33:45Z

kernels/triton/inference/mamba/causal_1d_conv/causal_1d_conv/causal_1d_conv.py

+    cache_seqlens: Optional[torch.Tensor] = None,
+    conv_state_indices: Optional[torch.Tensor] = None,
+    pad_slot_id: int = PAD_SLOT_ID,
+):


you are returning o but it is not listed here in your function signature?

lessw2020 · 2025-04-10T02:34:25Z

kernels/triton/inference/mamba/causal_1d_conv/causal_1d_conv/causal_1d_conv.py

+            for example: cache_indices = [pad_slot_id, 1 ,20 ,pad_slot_id]
+            in this case, the kernel will not process entries at
+            indices 0 and 3
+    out: (batch, dim) or (batch, dim, seqlen)


also inconsistent - out? (vs o)

lessw2020 · 2025-04-10T02:34:51Z

kernels/triton/inference/mamba/causal_1d_conv/causal_1d_conv/causal_1d_conv.py

+        conv_state_indices=conv_state_indices,
+        pad_slot_id=pad_slot_id,
+    )
+    return o


nit but not a fan of using o by itself... out or output etc. makes it more clear imo.

lessw2020 · 2025-04-10T02:36:07Z

Hi @thoangtrvn - thanks for the update and sorry for the delay!
I had a question re: the lack of return signature as well as inconsistent naming (out, then o) which are minor but would be nice to clarify it.
Otherwise looks good!

thoangtrvn · 2025-04-10T16:56:43Z

Thanks @lessw2020 , I'll update base on your feedback.

tmhoangt added 2 commits March 21, 2025 10:54

add 2nd kernel causal conv 1d update (decode)

380129e

Merge remote-tracking branch 'upstream/main'

eef52f9

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 21, 2025

lessw2020 reviewed Apr 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add mamba causal-conv1d-update kernel #48

add mamba causal-conv1d-update kernel #48

thoangtrvn commented Mar 21, 2025 •

edited

Loading

lessw2020 Apr 10, 2025

lessw2020 Apr 10, 2025

lessw2020 Apr 10, 2025

lessw2020 commented Apr 10, 2025

thoangtrvn commented Apr 10, 2025

add mamba causal-conv1d-update kernel #48

Are you sure you want to change the base?

add mamba causal-conv1d-update kernel #48

Conversation

thoangtrvn commented Mar 21, 2025 • edited Loading

lessw2020 Apr 10, 2025

Choose a reason for hiding this comment

lessw2020 Apr 10, 2025

Choose a reason for hiding this comment

lessw2020 Apr 10, 2025

Choose a reason for hiding this comment

lessw2020 commented Apr 10, 2025

thoangtrvn commented Apr 10, 2025

thoangtrvn commented Mar 21, 2025 •

edited

Loading